home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
ftp.cs.arizona.edu
/
ftp.cs.arizona.edu.tar
/
ftp.cs.arizona.edu
/
icon
/
newsgrp
/
group95a.txt
/
000035_icon-group-sender _Mon Jan 30 09:46:17 1995.msg
< prev
next >
Wrap
Internet Message Format
|
1995-02-09
|
2KB
Received: by cheltenham.cs.arizona.edu; Mon, 30 Jan 1995 03:33:39 MST
To: icon-group-l@cs.arizona.edu
Date: 30 Jan 1995 09:46:17 GMT
From: brady@cybernetics.net (Don Brady)
Message-Id: <3gich9$6oi@jabba.cybernetics.net>
Organization: Cybernetx, Inc.
Sender: icon-group-request@cs.arizona.edu
References: <1995Jan26.231752.14462@midway.uchicago.edu>, <keving.83.00E02940@primenet.com>, <1995Jan30.052611.25301@midway.uchicago.edu>
Subject: Re: Double-Byre Characters or Unicode?
Errors-To: icon-group-errors@cs.arizona.edu
I agree that I would not focus on Unicode alone.
The real requirement is to be able to process Japanese and
other languages with large character/symbol sets. I would
first support whatever representation schemes are
most widely used in the respective countries. (Since large
corpuses of text data will already be stored in those
formats and be difficult to convert).
You might or might not want to supporting DBCS, Shift-JIS, Unicode,
etc.
The ideal solution would be general enough to support all
of the above!
I believe that IBM has now made DBCS support a requirement of all
of their major languages. IBM Smalltalk and Visual Age
and PL/I already offer this support.
The most important thing is to be able to support run-time
data and dialogs in the formats in question. Programs could still be
written in conventional character sets, at least initially. I
believe this compromise is often the one chosen for other
languages or systems.
I agree that quite a bit of research will be needed to better
define the requirements (and implications for implementation).
You could look and see what IBM and Microsoft and GNU and
others have done for their languages.
I'd really love to see Icon available for processing Japanese
etc. Otherwise I'm going to have to switch back to PL/I or
something!